JH’s Blog”
LLM
Leetcode
Blog
About
Articles
Technical articles about large language models, recommendation system and general algorithms.
ExLlamaV2: The Fastest Library to Run LLMs
Quantize and run EXL2 models
Large Language Models
Nov 19, 2023
8 min
Quantize Llama models with GGUF and llama.cpp
GGML vs. GPTQ vs. NF4
Large Language Models
Sep 3, 2023
9 min
A Beginner’s Guide to LLM Fine-Tuning
How to fine-tune Llama and other LLMs with one tool
Large Language Models
Aug 27, 2023
8 min
4-bit LLM Quantization with GPTQ
Quantize your own open-source LLMs to run them on consumer hardware
Large Language Models
Jul 30, 2023
10 min
Fine-Tune Your Own Llama 2 Model in a Colab Notebook
A practical introduction to LLM fine-tuning
Large Language Models
Jul 24, 2023
10 min
Introduction to Weight Quantization
Large language model optimization using 8-bit quantization
Large Language Models
Jul 6, 2023
12 min
No matching items